var null vec
AT able of Notations
Eq. (13), we have Var null Q N. Putting it together, we have R (S Use the grouping of rows described in Step 2 to construct the block Householder quantizer. ResNet18/50 model, we adopt a slightly modified version, ResNetv1.5 [ We train for 200 epochs. Due to limited device memory, we set the batch size to 50 per GPU with 8 GPUs in total, the initial learning rate is 0.4. For both datasets, we use a cosine learning rate schedule, following [45].